A Linguistically Motivated MT Evaluation System Based on SVM Regression

نویسندگان

  • Muyun Yang
  • Shuqi Sun
  • Jufeng Li
  • Sheng Li
چکیده

This paper describes the automatic MT evaluation system by Machine Intelligence and Translation Lab. of Harbin Institute of Technology, for NIST MetricsMATR 2008 evaluation. The system is based on SVM regression and employed many linguistic features. Machine-learning based automatic MT evaluation has recently gained more attention in NLP research. And feature mining is essential for ML. In this work, a novel feature, letterbased BLEU, is introduced, and it turns out to be of good capability to correlate with human assessments and contributes to the system’s final performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

All in Strings: a Powerful String-based Automatic MT Evaluation Metric with Multiple Granularities

String-based metrics of automatic machine translation (MT) evaluation are widely applied in MT research. Meanwhile, some linguistic motivated metrics have been suggested to improve the string-based metrics in sentencelevel evaluation. In this work, we attempt to change their original calculation units (granularities) of string-based metrics to generate new features. We then propose a powerful s...

متن کامل

A Re-examination on Features in Regression Based Approach to Automatic MT Evaluation

Machine learning methods have been extensively employed in developing MT evaluation metrics and several studies show that it can help to achieve a better correlation with human assessments. Adopting the regression SVM framework, this paper discusses the linguistic motivated feature formulation strategy. We argue that “blind” combination of available features does not yield a general metrics wit...

متن کامل

Diagnostic Evaluation of Machine Translation Systems Using Automatically Constructed Linguistic Check-Points

We present a diagnostic evaluation platform which provides multi-factored evaluation based on automatically constructed check-points. A check-point is a linguistically motivated unit (e.g. an ambiguous word, a noun phrase, a verb~obj collocation, a prepositional phrase etc.), which are pre-defined in a linguistic taxonomy. We present a method that automatically extracts check-points from parall...

متن کامل

Developing Knowledge Bases for MT with Linguistically Motivated Quality-Based Learning

In this paper we present a proposal to help bypass the bottleneck of knowledge-based systems working under the assumption that the knowledge sources are complete. We show how to create, on the fly, new lexicon entries using lexico-semantic rules and how to create new concepts for unknown words, investigating a new linguistically-motivated model to trigger concepts in context.

متن کامل

A SVM Regression Based Skip-Ngram Approach to MT Evaluation

This paper describes an automatic MT evaluation metric named SNR by Machine Intelligence and Translation Lab. of Harbin Institute of Technology, for NIST MetricsMATR 2008 evaluation. The metric extend the idea of skip-bigram with larger span and multiple statistics. SVM regression method is adopted to tune the weights of statistics in the metric. The experimental results show that SNR correlate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008